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(57) Abstract 

A biologically active peptide is prepared 
by: a) preparing an expression vector compris- 
ing a DNA cassette containing two or more tan- 
dem repeating units of a nucleotide sequence en- 
coding a biologically active peptide and a linker 
peptide attached thereto, the linker peptide be- 
ing cleavable by a protease or a chemical agent; 
b) transforming a microorganism with the ex- 
pression vector, c) culturing the transformed mi- 
croorganism to produce a multimeric peptide ex- 
pressed by the DNA cassette; d) recovering and 
digesting the multimeric peptide with the pro- 
tease or the chemical agent to obtain the biolog- 
ically active peptide or an analog thereof carry- 
ing one or more amino acid residues originating 
from the linker peptide. 
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DNA CASSETTE ENCODING A MULTIMER OF A BIOLOGICALLY ACTIVE 
. PEPTIDE AND A CLEAVABLE LINKER ATTACHED THERETO AND 
PROCESS FOR PREPARING THE BIOLOGICALLY ACTIVE PEPTIDE 



5 FIELD OF THE INVENTION 



10 



15 



The present invention relates to a DNA cassette 
comprising two or more tandem repeating units of a 
nucleotide sequence encoding a biologically active peptide 
and a cleavable linker peptide attached thereto, an 
expression vector containing the DNA cassette, a 
microorganism transformed by the expression vector, and a 
process for preparing the biologically active peptide using 
the transformed microorganism. 



BACKGROUND OF THE INVENTION 



Many biologically active peptides such as enzymes and 
hormones have been produced by chemical synthesis or by in 
20 vivo synthesis using genetically engineered host 
microorganisms . 

The chemical synthesis method has been employed mainly 
for the production of very short peptides comprised of 2 to 
10 amino acid residues. This chemical method is, however, 
25 ineffective and uneconomical when applied to the production 
of peptides having more than 10 amino acid residues. 

The method of producing biologically active peptides in 
genetically engineered host microorganisms, on the other 
hand, has been widely used for the production of long 
30 polypeptides having 100 to 400 amino acid residues, e.g., 
growth hormones, insulin and restriction enzymes . However, 
this method is not suitable for preparing shorter peptides 
with less than 100 amino acid residues because of its 
inherent low productivity. 
35 Accordingly, there has existed a need to develop an 

economical process for preparing short biologically active 
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peptides comprising 10 to 100 amino acid residues. 
SUMMARY OF THE INVENTION 

5 It is, therefore, an object of the present invention to 

provide a novel DNA cassette containing multiple copies of 
a nucleotide sequence encoding a short peptide, an 
expression vector comprising the DNA cassette, a 
microorganism transformed by the vector, and a process for 

10 preparing the peptide using the transformed microorganism. 

In accordance with one aspect of the present invention, 
there is provided a DNA cassette which comprises two or more 
tandem repeating units of a nucleotide sequence encoding a 
biologically active peptide and a linker peptide attached 

15 thereto, said linker peptide being cleavable by a protease 
or a chemical agent. 



RRIEF DESCRIPTION OF THE DRAWINGS 

20 The above objects and features of the present invention 

will become apparent from the following description of 
preferred embodiments taken in conjunction with the 
accompanying drawings, in which: 

Fig- 1 shows the DNA fragment obtained by annealing 
25 oligonucleotides 1 and 2; 

Fig. 2 shows the procedure of designing the basic DNA 

unit . 

Fig. 3 depicts the left adaptor obtained by annealing 
oligonucleotides 3 and 4; 
30 Fig. 4 represents the right adaptor obtained by 

annealing oligonucleotides 5 and 6; 

Fig. 5 reproduces the electrophoresis result of the 174 

bp DNA cassette; 

Fig. 6 presents the DNA fragment obtained by annealing 

35 oligonucleotides 7; 

Fig. 7 discloses the synthetic DNA fragment obtained by 
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annealing oligonucleotides 8 and 9; 

Fig. 8 describes the procedure for constructing vector 
pRSET-B-cGnRH-II' -4R*; 

Fig. 9 provides the nucleotide sequence of the DNA 
5 cassette contained in vector pRSET-B-cGnRH-II ' -4R*; 

Fig. 10 offers the SDS-PAGE result of the 7.3 kDa 
protein expressed by vector pRSET-B-cGnRH-II ' -4R*; and 

Fig. 11 discloses the electrophoresis result of the 
peptides obtained by digesting the 7.3 kDa protein with 
10 trypsin. 

DETAILED DESCRIPTION OF THE INVENTION 

A DNA cassette of the present invention comprises two 
15 or more tandem repeating units of a nucleotide sequence 
encoding a biologically active peptide and a linker peptide 
attached thereto, said linker peptide being cleavable by a 
protease or a chemical agent. 

The biologically active peptide which may be used in 
20 the present invention includes any biologically active 
peptide comprised of 2 to 200 amino acids, preferably 10 to 
100 amino acids. Exemplary peptides include 

adrenocorticotropic hormone, angiotensin, atrial natriuretic 
peptide, bradykinin, . chemotactic hormone, dynorphin, 
25 endorphin, enkephalin, fibronectin, S-lipotropin fragments, 
gastrointestinal peptide, growth hormone releasing peptide, 
luteinizing hormone releasing hormone such as gonadotropin 
releasing hormone, melanocyte stimulating hormone, 
neurotensin, opioid peptide, oxytocin, vasopressin, 
30 vasotocin, parathyroid hormone, somatostatin, substance P 
and antigenic peptide. 

The linker peptide may have an amino acid residue or an 
amino acid sequence which is specifically recognized and 
digested by a protease or a chemical agent, thereby 
35 providing one or more cleavage sites. Many protease- or 
chemical agent -digestible amino acid links and amino acid 
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sequences are well known in the art of protein biochemistry; 
and. representative examples thereof are shown in Table I. 



Table I 



Protease or Chemical 
Agent 


Digestible Amino Acid or 
Amino Acid Sequence 


Protease 


Trypsin 


Lys A or Arg A 


Asparaginyl- 
endopeptidase 


Asn A 


Arginyl- 
endopeptidase 


Arg A 


Achromobacter 
Protease I 


Lys 1 


StaDhvlococcus 
aureus V8 
Protease 


Glu A 


rTEV Protease 


Glu-Asn-Leu-Tyr-Phe-Gln A -Gly 


Factor Xa 
Protease 


Ile-Glu-Gly-Arg 1 


Chemical 
Agent 


Cyanogen 
bromide 


Met A 



10 

* A represents a peptide bond digested by a protease or 
chemical agent. 

Using one or more digestible amino acid links and amino 
15 acid sequences as set forth above, various linker peptides 
may be designed as shown in Table II. 
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Table II 



Protease or Chemical Agent 


Linker Peptide 


Trypsin 


Lys 






Arg 






Lys - Arg 






Arg - J-iys 






jjys- 






(X) -Lys 






Arg- X) 






X) -Arg 






(X) n -Arg-Lys 






Lys -Arg- (X) n 






Arg-Lys- (X) n 






(X) n -Lys-Arg-(X) n 






(X) n -Arg-Lys-(X) n 






W -Arg- vaj L»ys 






(X) n -Lys-(X) n -Arg 






(X) n -Lys-(X) n -Arg 






(X)„-Arg- (X)„-Lys 


■(». 


Factor Xa Protease 


Ile-Glu-Gly-Arg 






Ile-Glu-Gly-Arg- 


(x) n 




(X) -Ile-Glu-Gly- 


Arg 




(X)„-Ile-Glu-Gly- 


Arg-(X)„ 


Cyanogen Bromide 


Met 




(X)-Met 






Met- (X) n 






(X^-Met- (X) n 





* X is Ala, Arg, Asn, Asp, Cys, Glu, Gin, Gly, His, He, 
Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; and n is 
an integral number. 

10 

A linker peptide attached to a biologically active 
peptide is used as a basic repeating unit to prepare a 
tandem repetitive multimeric peptide which yields multiple 
copies of the biologically active peptide when digested by 
15 a protease or a chemical agent. 

A biologically active peptide having a digestible C- 
terminal amino acid residue, e.g., lysine, arginine, 
asparagine, glutamine and methionine, may also be used in 
the present invention as a basic repeating unit. 
20 A nucleotide sequence encoding a basic repeating unit 

may be tandemly linked together to form a DNA cassette 
encoding a multimeric peptide. 

The DNA cassette encoding the multimeric peptide may be 
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obtained by ligating a basic DNA unit encoding the above 
mentioned basic repeating unit in accordance with a 
conventional ligation method (Sambrook et al., Molecular 
Cloning: A Laboratory Manual , 2nd ed. , Cold Spring Harbor 

5 Laboratory Press, USA(1989)). 

The basic DNA unit may be any of 5' -cohesive ended 
form, 3 '-cohesive ended form and blunt-ended form. Among 
those, the 5' -cohesive ended form and 3 '-cohesive ended form 
are preferred because of their ability to undergo a 

10 directional ligation to generate a DNA cassette having 
directional tandem repeats thereof. Particularly preferred 
is a DNA fragment having a double stranded DNA region 
encoding a target peptide and complementary single strand 
DNA sequences of a linker peptide protruding from 5' - or 3' - 

15 ends thereof. Such basic DNA unit may be obtained by 
chemically synthesizing a pair of complementary 
oligonucleotides encoding the basic repeating unit and 
annealing the oligonucleotides in accordance with a 
conventional annealing method (Sambrook et al., vide supra) . 

20 The nucleotide sequence of a basic DNA unit encoding a 

basic repeating unit may be deduced from the amino acid 
sequences of the target peptide and the linker peptide 
contained in the basic repeating unit based on the standard 
genetic code. 

25 The nucleotide sequence encoding a basic repeating unit 

may be further modified without changing the amino acid 
sequence so as to minimize the free energy of the secondary 
structure of RNA transcribed therefrom in case of N-terminal 
basic repeating unit, thereby increasing the expression 
30 efficiency thereof; or to lower the nucleotide homology 
between repeating nucleotide sequences; or to provide one or 
more preferred codons. 

Therefore, the basic DNA unit which may be used in the 
ligation reaction has a nucleotide sequence encoding a 
35 repeating unit and various modified forms thereof. 

The DNA cassette of the present invention may be 
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obtained by ligating in tandem a suitable number of basic 
DNA units encoding the basic repeating unit, the number 
ranging from 2 to 20, preferably 2 to 10. Such cassette 
encodes a multimeric peptide containing multiple copies of 
5 the basic repeating unit in tandem; and it may further 
contain an additional nucleotide sequence encoding a stuffer 
amino acid in the linker peptide region to modify the charge 
or hydrophobic ity of the multimeric peptide. 

The vector of the present invention, which is capable 
10 of expressing a multimeric peptide, may be obtained by 
inserting the DNA cassette containing a nucleotide sequence 
encoding a multimeric peptide in an expression vector 
according to a conventional cloning method (Sambrook et al . , 
vide supra ) . In this process, suitable adaptors may be 
15 attached to the DNA cassette in order to facilitate the 
cloning thereof before the insertion. 

The expression vector thus obtained may be introduced 
into a host microorganism such as Ej_ coli , in accordance 
with a conventional transformation method (Sambrook et al., 
20 the supra ) . 

A biologically active peptide or an analog thereof may 
be efficiently prepared by (a) culturing the transformed 
microorganism to produce a multimeric peptide expressed by 
the DNA cassette; (b) recovering the multimeric peptide from 
25 the culture; and (c) digesting the multimeric peptide with 
a protease or a chemical agent to obtain the biologically 
active peptide or the analog thereof, carrying one or more 
amino acid residues originating from the linker peptide. 

The transformed host microorganism may be cultured in 
30 accordance with a conventional method (Sambrook et al., the 
supra ) to express the multimeric peptide. The expression 
may be induced using an appropriate inducer, e.g., 
isopropyl - 1 - thio-fc-D-galactoside . 

The microorganism cell culture may be centrifuged or 
35 filtered to harvest microorganism cells. The cells are then 
lysed; and the multimeric peptide may be isolated therefrom 
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by a conventional method ( Methods in Enzymology , volume 182, 
Edited by Murray D. Deutscher, Academic press, Inc., New 
York, USA), e.g., ion exchange chromatography or gel 
filtration column chromatography, preferably anion exchange 
5 fast protein liquid chromatography, and, more preferably, 
Mono-Q anion exchange fast liquid chromatography (Pharmacia 
LKB, Sweden) . 

The purified multimeric peptide may be digested with an 
appropriate protease or chemical agent in accordance with a 
10 conventional digestion method ( Methods in Enzymology , vide 
supra ) to obtain a biologically active peptide, or an analog 
having one or more linker peptide -derived amino acids 
attached to N- or C-terminus thereof. 

The following Preparation Examples and Examples are 
15 intended to further illustrate the present invention without 
limiting its scope. 

Preparation Example 1 : Preparation of Basic DNA Unit 

20 A trypsin-digestible linker peptide represented by 

following amino acid sequence (SEQ ID NO: 1) was designed. 

Gly Lys Arg 

25 Based on the amino acid sequences of chicken 

gonadotropin releasing hormone II (designated cGnRH-II; 
Miyamoto et al., Proc. Natl. Acad. Sci . USA , 81, 3874-3878 
(1984)) and the above linker peptide, a basic peptide unit 
represented by following amino acid sequence was designed. 

30 This unit, designated cGnRH-II' , has the structure of linker 
peptide-cGnRH- II -linker peptide. 



35 



cGnRH-II (SEQ ID NO: 2): 

Glu His Trp Ser His Gly Trp Tyr Pro Gly 

5 10 
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cGnRH-II' (linker peptide-cGnRH- II -linker peptide) (SEQ ID NO: 
3) : 

Gly Lys Arg Glu His Trp Ser His Gly Trp Tyr Pro Gly Gly Lys 

5 10 15 

5 . Arg 

Using the standard genetic codes, a DNA fragment having 
the following nucleotide sequence (SEQ ID NO: 4) was deduced 
to match cGnRH-II' . 

10 

5 ' -GGGAAGAGAGAGCACTGGTCCCACGGGTGGTACCCAGGCGGGAAGAGA- 3 ' 

To obtain a basic DNA unit encoding cGnRH-II', the 
following pair of complementary oligonucleotides were 

15 synthesized and annealed to obtain a basic DNA unit shown in 
Fig. 1 which consists of a double stranded (ds) region 
encoding cGnRH-II and two single stranded (ss) regions 
protruding from both 3' ends thereof encoding the linker 
peptide, the two single strands being complementary to each 

20 other. 

sense strand oligonucleotide 1 (SEQ ID NO: 5) : 
5 ' - GAGCACTGGTCCCACGGGTGGTACCCAGGCGGGAAGAGA- 3 ' 
antisense strand oligonucleotide 2 (SEQ ID NO: 6) : 
25 5 ' - GCCTGGTGACCACCCGTGGGACCAGTGCTCTCTCTTCCC - 3 ' 

Fig. 2 shows the procedure of designing the basic DNA 

unit . 

This basic DNA unit was employed as a monomer in the 
30 preparation of a DNA cassette encoding a multimeric peptide 
composed of repeating units of (cGnRH-II- linker peptide) . 

Preparation Example 2 : Preparation of Left Adaptor 



35 For preparing a left adaptor useful for cloning the DNA 

fragment obtained in Preparation Example 1, the following 
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pair of complementary oligonucleotides were synthesized and 
annealed to obtain a left adaptor shown in Fig. 3 which 
consists of a 5' -protruding ss region having EcoRI site, a 
ds region having BamHI and Xmal/Smal sites, and a 3'- 
5 protruding ss region having a nucleotide sequence encoding 
the linker peptide. 

oligonucleotide 3 (SEQ ID NO: 8) : 
5 ' -AATTCAAGGATCCCCCGGGGGGAAGAGA-3 ' 
10 oligonucleotide 4 (SEQ ID NO: 9) : 
5 ' - CCCGGGGGATTCTTG - 3 ' " 

Preparation Example 3 : Preparation of Right Adaptor 

15 The procedure of Preparation Example 2 was repeated 

except for using the following pair of complementary 
oligonucleotides, to obtain a right adaptor shown in Fig. 4 
which consists of a 3 ' -protruding ss region having a 
nucleotide sequence complementary to that the encoding 

20 linker peptide, a ds region having Xhol and Hindlll sites, 
and a 5 ' -protruding ss region having Sail site. 

oligonucleotide 5 (SEQ ID NO: 10) : 
5 ' - CTCGAGAAGCTTACG -3' 
25 oligonucleotide 6 (SEQ ID NO: 11) : 
5 ' - TCGACGTAAGCTTCTCGAGTCTCTTCCC- 3 ' 

Example 1 : Construction of DNA Cassette 

30 The basic DNA unit obtained in Preparation Example 1 

was self-ligated at 16 °C for 2 hours to obtain a DNA 
cassette comprised of tandem repeats of the DNA unit. 

The DNA cassette was ligated with the left and right 
adaptors obtained in Preparation Examples 2 and 3, 

35 respectively, using T4 DNA ligase at 16 °C for 2 hours to 
obtain a DNA cassette. The DNA cassette was inserted in the 
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EcoRI/Sall sites of plasmid pUC19 (Yanish-Peron, C. et al., 
Gene , 33 , 103-119(1985)). coli JM109 was transformed 

• with the plasmid. 

The transformant JL. coli was cultured in LB 
5 medium (bacto-yeast extract 5 g, bacto-tryptone 10 g, NaCl 5 
. g and agar 15 g(pH7.2) per t) containing 50 /xg/ml ampicillin 
at 37 °C overnight. Plasmid DNA was purified from the 
culture and cleaved with EcoRI and Sail. DNA fragments thus 
obtained were subjected to electrophoresis on 3% metaphor 
10 agarose gel(FMC Co., USA). 

Fig. 5 reproduces the result of electrophoresis wherein 
lane 1 represents a 47 bp DNA fragment which is obtained by 
ligating the left and right adaptors; lane 2, a 86 bp DNA 
cassette which results from the ligation of the left 
15 adaptor, the basic DNA unit and the right adaptor; lane 3, 
a 174 bp DNA cassette which is the product of ligating the 
left adaptor, three basic DNA units and the right adaptor; 
and lane 4, a lkb ladder DNA molecular size markers. 

The plasmid containing 174 bp DNA cassette was 
20 designated pUC19-cGnRH-II ' -3R. 

Example 2 : Construction of Expression Vector containing 
DNA Cassette 

25 (Step 1) Insertion of a DNA cassette into an expression 
vector 

Plasmid pUC19-cGnRH-II' -3R obtained in Example 1 was 
cleaved with BamHI and Hindi 1 1 and the DNA cassette thus 
30 obtained was inserted in the BamHI/Hindlll sites of 
expression vector pRSET-B (Invitrogen Co., USA) to obtain 
vector pRSET-B-cGnRH-II' -3R. 



35 



(Step 2) Addition of a termination codon to a DNA cassette 
To add a termination codon to the DNA cassette, a DNA 
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fragment containing a termination codon, which is shown in 
Fig. 6, was prepared by annealing the following self- 
complementary oligonucleotide 7 (SEQ ID NO: 12), which 
provides a termination codon (TGA), an Ndel site and an Xmal 
5 site, and the DNA fragment thus obtained was inserted in the 
Hindlll site of vector pRSET-B-cGnRH-II' -3R obtained in Step 
1, to obtain vector pRSET-B-cGnRH-II ' -3R*. 

5 ' -AGCTTTGACAGCTGTCAA-3 ' 

10 

(Step 3) Addition of a modified basic DNA unit 

To increase the expression efficiency of the DNA 
cassette, the following complementary oligonucleotides were 

15 synthesized and annealed to obtain a synthetic DNA fragment 
shown in Fig. 7, which encodes methionine-linker peptide- 
cGnRH-II with a lower AG value for RNA secondary 
structure (computer program DNASIS™, Hitachi, Japan), and 
using the DNA fragment thus obtained the Ndel/Xmal fragment 

20 of vector pRSET-B-cGnRH-II' -3R* obtained in Step 2 was 
replaced, to obtain vector pRSET-B-cGnRH-II ' -4R*. 

oligonucleotide 8 (SEQ ID NO: 13) : 
5 ' - TATGGGCAAACGAGAGCATTGGTCACACGGGTGGTACC - 3 ' 
25 oligonucleotide 9 (SEQ ID NO: 14) : 

5 ' -CCCGGGTACCACCCGTGTGACCAATGCTCTCGTTTGCCCAT- 3 ' 

Fig. 8 describes the procedure for constructing vector 
pRSET-B-cGnRH-II' -4R* wherein C shows a basic DNA unit; AC, 
30 modified basic DNA unit; L, a left adaptor; R, a right 
adaptor; and S, a termination codon. 

The nucleotide sequence of the DNA cassette contained 
in vector pRSET-B-cGnRH-II' -4R* was determined using the 
dideoxy-mediated chain- termination method (Sanger, F. et al . , 
35 Proc. Natl. Acad. Sci. USA , 74, 5463-5467 (1977)). The 
resulting nucleotide sequence (SEQ ID NO: 15) of the DNA 
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cassette is shown below and details of the sequence are 
represented in Fig. 9 wherein t shows the trypsin-digestible 
site; and ***, the termination codon. 

5 5 ' -ATGGGCAAACGAGAGCATTGGTCACACGGGTGGTACCCCGGGGGGAAGAGAGAGC 
ACTGGTCCCACGGGTGGTACCCAGGCGGGAAGAGAGAGCACTGGTCCCACGGGTGGTA 
CCCAGGCGGGAAGAGAGAGCACTGGTCCCACGGGTGGTACCCAGGCGGGAAGAGACTC 
GAGAAGCTTTGACAGCTGTCAAAGCTT- 3 ' 

10 As can be seen in above nucleotide sequence and Fig. 9, 

the DNA cassette of vector pRSET-B-cGnRH-II ' -4R* 
consists of 183 bp nucleotides which contains four repeats 
of the nucleotide sequence encoding the repeating unit of 
(cGnRH- II -linker peptide) arranged in an open reading frame 

15 and a termination codon. 

IL_ coli TOP10F' (Invitrogen Co., USA) was transformed 
with vector pRSET-B-cGnRH-II' -4R* to obtain transformant E. 
coli cGnRH-II' -4R*. 

20 Example 3 : Expression of the DNA cassette 

Transformant E^ coli cGnRH-II' -4R* obtained in Example 
2 was cultured in SOB medium (tryp tone 20 g, yeast extract 
5.0 g, NaCl 0.5 g and KC1 1.86 mg per 1 t) containing 50 

25 /xg/ml ampicillin at 37 °C overnight. The resulting culture 
was diluted 200 fold with SOB medium and then incubated at 
37 °C. When the optical density at 600 nm reached 0.3, 
isopropyl-l-thio-S-D-galactoside was added thereto to a 
concentration of 1 mM. 5 pfu/cell of M13/T7 

30 phage (Invitrogen Co., USA) was added thereto, and the phage 
infected culture was incubated for 5 hours to express the 
DNA cassette. 

1 ml of the culture was centrifuged at 6,000 rpm for 5 
min to obtain a pellet and the pellet was suspended in 200 
35 lil of buffer(0.05M Tris-HCl, pH 6.8, 0.1M DTT, 2% SDS, 1% 
glycerol, and 0.1% bromophenol blue). The suspension was 
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heated at 90 °C for 5 min. and 10 y.1 thereof was subjected 
to sodium dodecylsulf ate polyacrylamide gel 
electrophoresis (SDS- PAGE) on 16% gel (Laemmli, U.K., Nature , 
227 , 680-685 (1970) ) . 
5 Fig. 10 reproduces the result of SDS-PAGE, wherein lane 

1 shows protein size markers; lane 2, proteins expressed in 
Ej_ coli transformed with vector pRSET-B-cGnRH-II' -3R*; lane 
3, proteins expressed in E^ coli transformed with vector 
pRSET-B-cGnRH-II' -4R*. As can be seen in Fig. 10, vector 
10 pRSET-B-cGnRH-II' -3R* expresses a low level of 9.2 kDa 
protein while vector pRSET-B-cGnRH-II' -4R* expresses a 
protein having a size of 8.0 kDa which is close to 7.3 kDa 
estimated for the DNA cassette shown in Fig. 9. 

15 Example 4 : Production of cGnRH-II analog 

« 

The culture obtained in Example 3 was centrifuged at 
2,000 x g for 20 min. at 4 °C to harvest E^ coli cells. The 
E. coli cells were washed twice with cold 20mM Tris-HCl 

20 buffer (pH 8.0) and suspended in 1/10 volume of the original 
culture. The resulting suspension was subjected to fast 
protein liquid chromatography. Proteins were adsorbed on a 
Mono Q 5/5 anion exchange column (Pharmacia LKB, Sweden) pre- 
equilibrated with 100% buffer A(20 mM Tris-HCl (pH 8.0) ) and 

25 eluted at a flow rate of 1 ml/min with a linear gradient 
generated using 100% buffer A and 100% buffer B(20mM Tris- 
HCl at pH 8.0 containing 1.0M NaCl) . Combined protein 
fractions were desalted by dialyzing against 20 mM Tris- 
HCl (pH 8.0) and lyophilized. The lyophilized powder was 

30 resuspended in 20 mM Tris-HCl (pH 8.0) and then digested with 
trypsin at room temperature for 3 hours. 

The digested protein was subjected to electrophoresis 
on Tricine gel having a concentration gradient from 10 to 
20%(Novex, USA) . The results are shown in Fig. 11, wherein 

35 lane 1 shows a 7.3 kDa protein; lanes 5, the peptide 
obtained by digesting the 7.3 kDa protein with trypsin; and 
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lane 6, trypsin. As can be seen in Fig. n, the 7.3 kDa 
protein is a tandem repetitive multimer of the 1.4 kDa 
monomer peptide. 

In order to examine the amino acid sequence of 1.4 kDa 
5 peptide, the 1.4 kDa peptide was transferred to a 
polyvinyl idene difluoride membrane (Mat sudaria, 1987) and the 
amino acid sequence was determined with a protein 
sequencer (Applied Biosystem, model 476A, USA) in a pulse- 
liquid mode. The resulting sequence (SEQ ID NO: 16) of the 
10 1.4 kDa peptide is shown below. 

Glu His Trp Ser His Gly Trp Tyr Pro Gly Gly Lys 

5 10 

15 As can be seen from the above amino acid sequence, the 

1.4 kDa peptide is a cGnRH-II analog consisting of cGnRH-II 
having the linker-derived Gly-Lys sequence attached to the 
C-terminus . 

20 While the invention has been described with respect to 

the above specific embodiments, it should be recognized that 
various modifications and changes may be made to the 
invention by those skilled in the art which also fall within 
the scope of the invention as defined by the appended 

25 claims. 



WO 00/17336 



PCT/KR99/00559 



- 16 - 

What is claimed is: 

1. A DNA cassette which comprises two or more tandem 
repeating units of a nucleotide sequence . encoding a 

5 biologically active peptide and a linker peptide attached 
thereto, the linker peptide being cleavable by a protease or 
a chemical agent. 

2. The DNA cassette of claim 1, wherein the 
10 biologically active peptide is comprised of 2 to 200 amino 

acid residues. 

3. The DNA cassette of claim 2, wherein the 
biologically active peptide is selected from the group 

15 consisting of adrenocorticotropic hormone, angiotensin, 
atrial natriuretic peptide, bradykinin, chemo tactic hormone, 
dynorphin, endorphin, enkephalin, fibronectin, fi-lipotropin 
fragments, gastrointestinal peptide, growth hormone 
releasing peptide, luteinizing hormone releasing hormone, 

20 melanocyte stimulating hormone, neurotensin, opioid peptide, 
oxytocin, vasopressin, vasotocin, parathyroid hormone, 
somatostatin, substance P and antigenic peptide. 

4. The DNA cassette of claim 3, wherein the 
25 biologically active peptide is chicken gonadotropin 

releasing hormone II. 

5. The DNA cassette of claim 4, which has a following 
nucleotide sequence(SEQ ID NO: 15): 

30 

5 ' -ATGGGCAAACGAGAGCATTGGTCACACGGGTGGTACCCCGGGGGGAAGAGAGAGC 
ACTGGTCCCACGGGTGGTACCCAGGCGGGAAGAGAGAGCACTGGTCCCACGGGTGGTA 
CCCAGGCGGGAAGAGAGAGCACTGGTCCCACGGGTGGTACCCAGGCGGGAAGAGACTC 
GAGAAGCTTTGACAGCTGTCAAAGCTT - 3 ' . 

35 

6. The DNA cassette of claim 1, wherein the linker 
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peptide cleavable by a protease has an amino acid link or an 
amino acid sequence selected from the group consisting of: 
(i) lysine and arginine, which are cleavable by 
trypsin; 

5 (ii) asparagine, cleavable by asparaginylendopeptidase; 

(iii) arginine, cleavable by arginylendopeptidase; 

(iv) lysine, cleavable by Achromobacter protease I; 

(v) glutamine, cleavable by Staphylococcus aureus V8 
10 protease; 

(vi) Glu-Asn-Leu-Tyr-Phe-Gln-Gly, cleavable by tTEV 
protease; and 

(vii) Ile-Glu-Gly-Arg, cleavable by factor Xa. 

15 7. The DNA cassette of claim 6, wherein the linker 

peptide is Gly-Lys-Arg. 

8. The DNA cassette of claim 1, wherein the linker 
peptide cleavable by a chemical atjent has an amino acid 

20 sequence containing methionine which is cleavable by 
cyanogen bromide. 

9 . A process for preparing a DNA cassette which 
comprises ligating a basic DNA unit having a 5' -cohesive 

25 ended form or 3' -cohesive ended form, the basic DNA unit 
composed of a double stranded DNA region encoding a 
biologically active peptide and two complementary single 
stranded DNA regions protruding from 5'- or 3' -ends thereof 
encoding a linker peptide cleavable by a protease or a 

30 chemical agent. 

10. A DNA cassette which comprises two or more tandem 
repeating units of a nucleotide sequence encoding a 
biologically active peptide having a C-terminal amino acid 

35 residue which is cleavable by a protease or a chemical 
agent . 
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11. The DNA cassette of claim 10, wherein the 
biologically active peptide is comprised of 2 to 200 amino 
acid* residues. 

5 12. The DNA cassette of claim 10, wherein the C- 

terminal amino acid residue is selected from the group 
consisting of: 

(i) lysine and arginine, which are cleavable by 
trypsin; 

10 (ii) asparagine, cleavable by asparaginylendopeptidase; 

(iii) arginine, cleavable by arginylendopeptidase; 

(iv) lysine, cleavable by Achromobacter protease T ? and 

(v) glutamine, cleavable by Staphylococcus aureus V8 
15 protease. 

13. The DNA cassette of claim 10, wherein the C- 
terminal amino acid residue is methionine which is cleavable 
by cyanogen bromide. 

20 

14 . An expression vector comprising the DNA cassette 
of any of claims 1 to 8 and 10 to 13 . 

15. The expression vector of claim 14, which is pRSET- 
25 B-cGnRH-II' -4R*. 

16. A microorganism transformed with the expression 
vector of claim 14 . 

30 17. The microorganism of claim 16, which is coli 

transformed with pRSET-B-cGnRH-II ' -4R*. 

18. A process for preparing a biologically- active 
peptide or an analog thereof which comprises the steps of: 
35 (a) preparing an expression vector comprising a DNA cassette 
containing two or more tandem repeating units of a 
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nucleotide sequence encoding a biologically active peptide 
and a linker peptide attached thereto, the linker peptide 
being cleavable by a protease or a chemical agent; 

(b) transforming a microorganism with the expression vector; 
5 (c) culturing the transformed microorganism to produce a 
multimeric peptide expressed by the DNA cassette; and 

(d) recovering and digesting the multimeric peptide with the 
protease or the chemical agent to obtain the biologically 
active peptide or an analog thereof carrying one or more 
10 amino acid residues originating from the linker peptide. 

19. A process for preparing a biologically active 
peptide which comprises the steps of: 

(a) preparing an expression vector comprising a DNA cassette 
15 containing two or more tandem repeating units of a 

nucleotide sequence encoding a biologically active peptide 
having a C- terminal amino acid residue which is cleavable by 
a protease or a chemical agent; 

(b) transforming a microorganism with the expression vector; 
20 (c) culturing the transformed microorganism to produce a 

multimeric peptide expressed by the DNA cassette; and 
(d) recovering and digesting the multimeric peptide with the 
protease or the chemical agent to obtain the biologically 
active peptide. 
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FIG.1 



5' -GAG CAC TGG TCC CAC GGG TGG TAC CCA GGC GGG AAG AGA-3' 
3-CCC HC TCT CTC GTG ACC AGG GTG CCC ACC ATG GGT CCG-5' 
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FIG. 3 



Nucleotide sequence 
EcoRI BaraHI Xmal/Smal encoding linker peptide 

A ATT CAA GGA TCC CCC GGG GGG AAG AGA 
GTT CTT AGG GGG CCC 



FIG. 4 



CTC GAG AAG CTT ACG 

CCC TTC TCT GAG CTC TTC GAA TGC AGC T 
Nucleotide sequence Xhol HindHI Sail 
complementary to that 
encoding linker peptide 
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FIG. 5 



12 3 4 




FIG. 6 



5'-AG CTT TGA CAG CTG TCA A-3* 

3'-A ACT GTC GAC AGT TTC GA-5' 



FIG. 7 



5'-T ATG GGC AAA CGA GAG CAT TGG TCA CAC GGG TGG TAC C -3' 
3'-TAC CCG TTT GTC CTC GTA ACC AGT GTG CCC ACC ATG GGC CC-5' 
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FIG. 8 



BamHI 



- C - C 



- C - R 



Hindlll 

il 



Small fragment of pUC19-cGnBH-II-3R 



T promoter 
f «U BamHI 

f pRSET-B r-Hindffl 




Insert a stop codon(***) synthetic DNA 



HindIII***PvuII Hindlll 
AG CTT TGA CAG CTG TCA A 

A ACT GTC GAC AGT TTC GA 




Replacing Nedl-Xmal fragment 



Met Gly Lys Arg Glu His Trp Ser His Gly Trp Tyr 
cGnRH] GGC AAA CGA GAG CAT TGG TCA CAC GGG TGG TAC C 

TAC CCG TTT GCT CTC GTA ACC AGT CCC ACC ATG GGG CC 

Xmal 

Mel-linker prptide-cGnRH-U portion based on AG value 



Nddl 



?-l v Hindlll 



Xhol Hindlll 
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FIG. 9 



Met- Gly-Lys j Arg t Glu-His-Trp-Ser-His-Gly-Trp-Tyr-Pro-Gly ( 14) 

(T) ATG GGC AAA CGA GAG CAT TGG TCA CAC GGG TGG TAC CCC GGG 42 
Ndel Xmal/Smal 



linker peptide 



cGnRH-11 peptide 



Gly-Lys j Arg i G lu-His-Trp-Ser-His-Gly-Trp-Tyr-Pro-Gly (27) 
GGA AAG AGA GAG CAC TGG TCC CAC GGG TGG TAC CCA GGC 81 



linker peptide 



cGnRH-11 peptide 



Gly-Lys t Arg t G lu-His-Trp-Ser-His-Gly-Trp-Tyr-Pro-Gly (40) 
GGA AAG AGA GAG CAC TGG TCC CAC GGG TGG TAC CCA GGC 120 



linker peptide] [ 



cGnRH-11 peptide 



Gly-Ly3 I Arg i G lu-His-Trp-Ser-His-Gly-Trp-Tyr-Pro-Gly (53) 
GGA AAG AGA GAG CAC TGG TCC CAC GGG TGG TAC CCA GGC 159 



linker peptide 



cGnRH-11 peptide 



Gly-Lys t ArgtL eu-Glu-LystLeu- *♦* (62) 

GGG AAG AGA CTC GAG AAG CTT TGA CAG CTG TCA AAG CTT 198 
Xhol HindlD Pvull Hindlll 



linker peptide 



stuffer amino acids 
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FIG. 1 0 




FIG. 1 1 

1 2 3 4 5 6 
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PCT/ISA/210 (patent family annex) (July 1998) 



